M Atrix Capsules with Em Routing
نویسندگان
چکیده
A capsule is a group of neurons whose outputs represent different properties of the same entity. Each layer in a capsule network contains many capsules. We describe a version of capsules in which each capsule has a logistic unit to represent the presence of an entity and a 4x4 matrix which could learn to represent the relationship between that entity and the viewer (the pose). A capsule in one layer votes for the pose matrix of many different capsules in the layer above by multiplying its own pose matrix by trainable viewpoint-invariant transformation matrices that could learn to represent part-whole relationships. Each of these votes is weighted by an assignment coefficient. These coefficients are iteratively updated for each image using the Expectation-Maximization algorithm such that the output of each capsule is routed to a capsule in the layer above that receives a cluster of similar votes. The transformation matrices are trained discriminatively by backpropagating through the unrolled iterations of EM between each pair of adjacent capsule layers. On the smallNORB benchmark, capsules reduce the number of test errors by 45% compared to the state-of-the-art. Capsules also show far more resistance to white box adversarial attacks than our baseline convolutional neural network.
منابع مشابه
M ATRIX CAPSULES WITH EM ROUTING Geoffrey Hinton
A capsule is a group of neurons whose outputs represent different properties of the same entity. Each layer in a capsule network contains many capsules. We describe a version of capsules in which each capsule has a logistic unit to represent the presence of an entity and a 4x4 matrix which could learn to represent the relationship between that entity and the viewer (the pose). A capsule in one ...
متن کاملOn the M(atrix)-model for M-theory on T
We study consistency conditions on a M(atrix)-model which would describe M-theory on T . We argue that there is a limit in moduli space for which it becomes a 6+1D theory and study the low-energy description of extended objects in the decompactified limit. We discuss the requirements from a M(atrix)-model which would describe such an E6(6) theory and we suggest that a 1+1D theory with (0, 4) su...
متن کاملA note on M(atrix) theory in seven dimensions with eight supercharges
We consider M(atrix) theory compactifications to seven dimensions with eight unbroken supersymmetries. We conjecture that both M(atrix) theory on K3 and Heterotic M(atrix) theory on T 3 are described by the same 5+1 dimensional theory with N = 2 supersymmetry broken to N = 1 by the orbifold projection. The emergence of the extra dimension follows from a recent result of Rozali (hep-th/9702136)....
متن کاملM(atrix) Theory of Anti-zero-branes
M(atrix) theory defines light-front description of the boosted M-theory along the eleventh Mcoordinate. Rank of M(atrix) gauge group U(N) is directly related to the momentum along M-coordinate P11 = N/R11 or, equivalently, the number of D0-partons. Alternatively, Mtheory may be boosted to opposite direction of eleventh M-coordinate. We argue that the corresponding M(atrix) theory is described v...
متن کاملWilson Lines and T - Duality in Heterotic M ( atrix ) Theory
We study the M(atrix) theory which describes the E8×E8 heterotic string compactified on S , or equivalently M-theory compactified on an orbifold (S/Z 2)×S, in the presence of a Wilson line. We formulate the corresponding M(atrix) gauge theory, which lives on a dual orbifold S × (S/Z 2). Thirty-two real chiral fermions must be introduced to cancel gauge anomalies. In the absence of an E8 × E8 Wi...
متن کامل